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Cxi) sequence description: seq id no:1: 
cccaaaaaag ataaaataaa aacaaaacaa aacaaaagta ctaacaaatt attgaaactt 60 
ttaattttta ataaagaatc agtagatcta ttgttaaaag aaatgaactc aactccaagt 120 

I 



AAATTATTAC 


CGATAGATAA 


ACATTCTCAT 


TTACAATTAC 


AGCCTCAATC 


GTCCTCGGCA 

VJ I V_ V— I V_ VJ VJ V— 


180 


TCAATA 1 1 1 A 


ATTCCCCAAC 


AAAACCATTG 


AATTTCCCCA 


GAACAAATTC 


CAAGCCGAGT 


240 


TTAGATCCAA 


ATTCAAGCTC 


TGATACCTAC 


ACTAGCGAAC 


AAGATCAAGA 


GAAAGGGAAA 


300 


GAAGAGAAAA 


AGGACACAGC 


CTTTCAAACA 


TCI 1 1 IGATA 


GAAAI 1 1 IGA 


TCTTGATAAT 

1 V_* 1 1 VJ-/ % 1 1 \J \ l 


360 


TCAATCGATA 


TACAACAAAC 


AATTCAACAT 


CAGCAACAAC 


AGCCACAACA 


ACAACAACAA 


420 


CTCTCACAAA 


CCGACAATAA 


TTTAATTGAT 


GAA 1 1 1 1 C 1 1 


TTCAAACACC 

i i v_y w \j \ v_/ v v_ v_* 


GATGACTTCG 

VJ I VJ/V V_ 1 1 V_ VJ 


480 


ACTTTAGACC 


TAACCAAGCA 


AAATCCAACT 


GTGGACAAAG 


TGAATGAAAA 

1 VJ/vr\ 1 VJ/V/v^Vr^ 


TCATGCACCA 

1 V_AA 1 VJ V_/A V_ V_ AA 


540 


ACTTATATAA 


ATACCTCCCC 


CAACAAATCA 


ATAATGAAAA 


AGGCAACTCC 


TAAAGCGTCA 

1 * \t \i \ VJ V— VJ 1 V-^\ 


600 


CCTAAAAAAG 


TTGCATTTAC 


TGTAACTAAT 


CCCGAAATTC 


ATCATTATCC 


AGATAATAGA 


660 


GTCGAGGAAG 


AAGATCAAAG 


TCAACAAAAA 


GAAGATTCAG 


TTGAGCCACC 


CTTAATACAA 


720 


CATCAATGGA 


AAGATCCTTC 


TCAATTCAAT 


TATTCTGATG 


AAGATACAAA 


TGCTTCAGTT 


780 


CCACCAACAC 


CACCACTTCA 


TACGACGAAA 


CCTAC 1 1 1 1 G 


CGCAATTATT 

x_.VJ v-nn 1 1 f \ 1 1 


GAACAAAAAC 

VJ/\/\ V/Ar^rArAr^ V_ 


840 


AACGAAGTCA 


ATCTGGAACC 


AGAGGCATTG 

t \ VJ/ \ VJ VJ V/i 1 1 VJ 


ACAGATATGA 

n\-Tivr\ i r\ i vjaa 


AATTAAAGCG 

AAAA 1 1 AAAAAAVJ V- Vj 


CGAAAA 1 I If 

V_Vj AAAA AAAA 1 I I v_ 


Zf\J\J 


AGCAATTTAT 


CATTAGATGA 


AAAAGTCAAT 


TTATATCTTA 


CTFTCrACTAA 

VJ 1 V_V_.V_.AAv. 1 AAAA 


TAATAArAAT 

I AAAA 1 AAAA v_ AAAA I 




AGTAAGAATG 


TGTCAGATAT 


GGATCTGCAT 


TTACAAAACT 


TGCAAGACGC 


TTCGAAAAAC 

1 1 V_ VJAA/A/AAA/A \_ 


1070 

JLV/__ V/ 


AAAACTAATG 


AAAATATTCA 


CAA 1 1 1 G 1 CA 


TTTGCI I IAA 


AAGCACCAAA 

t \J \ VJ V-/ V V— V—/ W V/^ 


GAATGATATT 

vj/ i/ \ i vj/^\ i i i 


1080 


GAAAACCCAT 


TAAACTCATT 


GACTAACGCA 


GATATTCTGT 


TAAGATCATC 


TGGATCATCA 

1 VJ VJAA 1 V_/ \ f V_,AA 


1140 


CAATCGTCAT 


TACAATCI 1 1 


GAGGAATGAC 


AATCGTGTCT 

AAAA 1 V— VJ 1 VJ 1 V — 1 


TGGAATCAGT 

I VJ VJAAAA 1 V_AAVJ 1 


GCCTGG&TC A 

Vj V_ V_ 1 V3 VJVJ f V_AA 


1 700 

-L__ v/v/ 


CCTAAGAAGG 


TTAATCCTGG 

I 1 \ 1 I VJ VJ 


A 1 1 G 1 C 1 1 1 G 

r\ i i vj i v.. i i i vj 


AATGACGGCA 

/™\AA 1 VJrtV_.vJVJV_.AA 


TAAAGGGGTT 

1 AAAAAA VjVjVjVj I 1 


CTCTG ATG A f_ 

V_ 1 V 1 VJAA 1 VJAAVJ 


-L_- UU 


GTTGTTGAAT 


CATTACTTCC 


TCGTGACTTA 


TCTCGAGACA 

1 V_ 1 V_ VJ AA VJ AA V, AA 


AATTAGAGAC 

AAAA 1 t AAVJAAVJAA V_ 


TArAAAAGAA 

1 AA V_. AAAA AAAA V3 AAAA 


1 370 


CATGATGCAC 


CAGAACACAA 


CAATGAGAAT 


TTTATTGATG 

III t \ 1 1 \JI V 1 VJ 


CTAAATCGAC 

V, I rUA/A 1 V-VI/AV, 


TAATACCAAT 

1 nn 1 AAV, V, AAAA 1 


1380 


AAGGGACAAC 


TCTTAGTATC 


ATCTGATGAT 


CATTTGGACT 


CI 1 1 IGATAG 

V^ Kill VJ/^> 1 / \ VJ 


ATCCTATAAC 

f \ 1 VV 1 r\ ■ j/^jr^ \___. 


1440 


CACACTGAAC 


AATCAAI 1 1 1 


GAATCI IMG 


AATAGTGCAT 

* u V 1 f \ VJ 1 VJ V-/^ 1 


CACAATCTCA 

\_nVJA/A I V_ 1 V_/\ 


AATTTCGTTA 

AAAA III V_ VJ 1 1 AA 


1500 


AATGCATTGG 


AAAAACAAAG 


GCAAACACAG 


GAACAAGAAC 

VJ/\/\ V—AA/A VJ/A/\ \_ 


AAACACAAGC 

AAAAAA V_AAV_~AAAAVJV-. 


VJVJ V_AA VJAAVJ \_,V_ 1 


1 5fi0 


GAAGAAGAAA 


CTTCGTTTAG 


TGATAATATC 


AAAGTTAAAC 

rl/TMVJ I I AAAAAA V_ 


AAGAGCCAAA 

/A/AVjAAVjV_.V_. AAAAAA 


GAGCAATTTG 

VJAAVJ V_ AAAA I 1 1 VJ 


1 fi70 


GAG 1 1 1 G I CA 

VJjV VJ III VJ » V^J^A 


AGGTTACCAT 

/VVJ VJ 1 1 aav_v_,/-\ 1 


caagaaagaa 

V_AAAA V3MMM VjV-\/-\ 


CC ACTTTCTCC 

V_V_AAVJ 1 1 V_ t VJV_ 


CCACCiCZ A A AT 

V_V_MV_.VJVJ AAAAAA 1 


AAA A(Z( — VCC A 

MMaaAAVJV- 1 V_V_AA 


jlDOU 


AAAAGAGAAT 


TTTCAAGTCG 


AATATTAAGA 


ATAAAAAATG 

r\ 1 AAAAAA AAAAAA 1 VJ 


A AGATHA A AT 

AAAA VJAA 1 V3 AAAAAA 1 


TdCCCAACCA 

1 VJV-.V_VJMAAV_.V_.AA 


JL / *+v/ 


GCTGATATTC 


ATCCTAAAAA 


AGAAAATGAA 

fA\3rir\rv^ l VJ/AAA 


GCAAArAfvrr 

VJ V_ AAAAAA V_AAVJ 1 V- 


ATGTrf;AAf;A 

AA 1 \J I V, VJAAAA VJAA 


T A CTC* A TG C A 

1 AAV- 1 VJAA 1 VJV_AA 


-LOUu 


TTGTTGAAGA 


AAGCACTTAA 


TGATGATGAG 


GAATCTGACA 


CGACCCAAAA 


CTCAACGAAA 


1860 


ATGTCAATTC 


Gilt ICATAT 


TGATAGTGAT 


TGGAAATTGG 


AAGACAGTAA 


TGATGGCGAT 


1920 


AGAGAAGATA 


ATGATGATAT 


TTCTCG 1 1 1 1 


GAGAAATCAG 


ATA 1 1 1 1 GAA 


CGACGTATCA 


1980 


CAGACTTCTG 


ATATTATTGG 


TGACAAATAT 


GGAAACTCAT 


CAAGTGAAAT 


AACCACCAAA 


2040 
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ACATTAGCAC 


CCCCAAGATC 


GGACAACAAT 


GACAAGGAGA 


ATTCTAAATC 


TTTGGAAGAT 


2100 


CCAGCTAATA 


ATGAATCATT 


GCAACAACAA 


TTGGAGGTAC 


CGCATACAAA 


AGAAGATGAT 


2160 


AGCA 1 1 1 1 AG 


CCAACTCGTC 


CAATATTGCT 


CCACCTGAAG 


AATTGACTTT 


GCCCGTAGTG 


2220 


GAAGCAAATG 


ATTATTCATC 


1 1 1 IAATGAC 


GTGACCAAAA 


CTTTTGATGC 


ATACTCAAGC 


2280 


TTTGAAGAGT 


CATTATCTAG 


AGAGCACGAA 


ACTGATTCAA 


AACCAATTAA 


TTTCATATCA 


2340 


ATTTGGCATA 


AACAAGAAAA 


GCAGAAGAAA 


CATCAAATTC 


ATAAAGTTCC 


AACTAAACAG 


2400 


ATCATTGCTA 


GTTATCAACA 


ATACAAAAAC 


GAACAAGAAT 


CTCGTGTTAC 


TAGTGATAAA 


2460 


GTGAAAATCC 


CAAATGCCAT 


ACAATTCAAG 


AAATTCAAAG 


AGGTAAATGT 


CATGTCAAGA 


2520 


AGAGTTGTTA 


GTCCAGACAT 


GGATGATTTG 


AATGTATCTC 


AAl 1 1 1 IACC 


AGAATTATCT 


2580 


GAAGACTCTG 


GATTTAAAGA 


TTTGAAI 1 1 1 


GCCAACTACT 


CCAATAACAC 


CAACAGACCA 


2640 


AGAAGI MIA 


CTCCATTGAG 


CACTAAAAAT 


GTCTTGTCGA 


ATATTGATAA 


CGATCCTAAT 


2700 


GTTGTTGAAC 


CTCCTGAACC 


GAAATCATAT 


GCTGAAATTA 


GAAATGCTAG 


ACGGTTATCA 


2760 


GCTAATAAGG 


CAGCGCCAAA 


TCAGGCACCA 


CCATTGCCAC 


CACAACGACA 


ACCATCTTCA 


2820 


ACTCGTTCCA 


ATTCAAATAA 


ACGAGTGTCC 


AGATTTAGAG 


TGCCCACATT 


TGAAATTAGA 


2880 


AGAACTTCTT 


CAGCATTAGC 


ACCTTGTGAC 


ATGTATAATG 


ATA 1 II MGA 


TGATTTCGGT 


2940 


GCGGGTTCTA 


AACCAACTAT 


AAAGGCAGAA 


GGAATGAAAA 


CATTGCCAAG 


TATGGATAAA 


3000 


GATGATGTCA 


AGAGGA Mil 


GAATGCAAAG 


AAAGGTGTGA 


CTCAAGATGA 


ATATATAAAT 


3060 


GCCAAACTTG 


TTGATCAAAA 


ACCTAAAAAG 


AATTCAATTG 


TCACCGATCC 


CGAAGACCGA 


3120 


TATGAAGAAT 


TACAACAAAC 


TGCCTCTATA 


CACAATGCCA 


CCATTGATTC 


AAGTATTTAT 


3180 


GGCCGACCAG 


ACTCCATTTC 


TACCGACATG 


TTGCCTTATC 


TTAGTGATGA 


ATTGAAAAAA 


3240 


CCACCTACGG 


CTTTATTATC 


TGCTGATCGT 


TTGTTTATGG 


AACAAGAAGT 


ACATCCGTTA 


3300 


AGATCAAACT 


CTGTTTTGGT 


TCACCCAGGG 


GCAGGAGCAG 


CAACTAATTC 


TTCAATGTTA 


3360 


CCAGAGCCAG 


AMI IGAATT 


AATCAATTCA 


CCTGCTAGAA 


ATGTGCTGAA 


CAACAGTGAT 


3420 


AATGTCGCCA 


TCAGTGGTAA 


TGCTAGTACT 


ATTAGI 1 1 IA 


ACCAATTGGA 


TATGAAI 1 1 1 


3480 


GATGACCAAG 


CTACAATTGG 


TCAAAAAATC 


CAAGAGCAAC 


CTGCTTCAAA 


ATCCGCCAAT 


3540 


ACTGTTCGTG 


GTGATGATGA 


TGGATTGGCC 


AGTGCACCTG 


AAACACCAAG 


AACTCCTACC 


3600 


AAAAAGGAGT 


CCATATCAAG 


CAAGCCTGCC 


AAGC 1 II L II 


CTGCCTCCCC 


TAGAAAATCA 


3660 


CCAATTAAGA 


TTGGTTCACC 


AGTTCGAGTT 


ATTAAGAAAA 


ATGGATCAAT 


TGCTGGCATT 


3720 


GAACCAATCC 


**** A A A A ^ y~ A f- 

CAAAAGCCAC 


TCACAAACCG 


A A A A A T/~ A ~r 

AAGAAATCAT 


Tf~T~ A A f~ (~ AAA 

TCCAAGGAAA 


rr~ a r An — i — rt~ a 
LCjAQjA 1 1 1 v_A 


J / OU 


AACCATAAAG 


TACGAGATGG 


TGGAATTTCA 


CCAAGCTCCG 


GATCAGAGCA 


TCAACAGCAT 


3840 


AATCCTAGTA 


TGGTTTCTGT 


TCCTTCACAG 


TATACTGATG 


CTACTTCAAC 


GGTTCCAGAT 


3900 


GAAAACAAAG 


ATGTTCAACA 


CAAGCCTCGT 


GAAAAGCAAA 


AGCAAAAGCA 


TCACCATCGC 


3960 
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CATCATCATC 


ATCATCATAA 


ACAAAAAACT 


GATATTCCGG 


GTGTTGTTGA 


TGATGAAATT 


4020 


CCTGATGTAG 


GATTACAAGA 


ACGAGGCAAA 


TTATTCTTTA 


GAG MM AGG 


AATTAAGAAT 


4080 


ATCAATTTAC 


CCGATATTAA 


TACTCACAAA 


GGAAGATTCA 


CTTTAACGTT 


GGATAATGGA 


4140 


GTGCATTGTG 


TTACTACACC 


AGAATACAAC 


ATGGACGACC 


ATAATGTTGC 


CATAGGTAAA 


4200 


GAATTTGAGT 


TGACAGTTGC 


TGATTCATTA 


GAG 1 1 IATTT 


TAACTTTGAA 


GGCATCATAT 


4260 


GAAAAACCTC 


GTGGTACATT 


AGTAGAAGTG 


ACTGAAAAGA 


AAGTTGTCAA 


ATCAAGAAAT 


4320 


AGATTGAGTC 


GATTA II 1 GG 


ATCGAAAGAT 


ATTATCACCA 


CGACAAAGTT 


TGTGCCCACT 


4380 


GAAGTCAAAG 


ATACCTGGGC 


TAATAAG 1 1 1 


GCTCCTGATG 


GTTCATTTGC 


TAGATGTTAC 


4440 


ATTGATTTAC 


AACAATTTGA 


AGACCAAATC 


ACCGGTAAAG 


CATCACAGTT 


TGATCTCAAT 


4500 


TGTTTTAATG 


AATGGGAAAC 


TATGAGTAAT 


GGCAATCAAC 


CAATGAAAAG 


AGGCAAACCT 


4560 


TATAAGATTG 


CTCAATTGGA 


AGTTAAAATG 


TTGTATGTTC 


CACGATCAGA 


TCCAAGAGAA 


4620 


ATATTACCAA 


CCAGCATTAG 


ATCCGCATAT 


GAAAGCATCA 


ATGAATTAAA 


CAATGAACAG 


4680 


AATAATTACT 


TTGAAGGTTA 


TTTACATCAA 


GAAGGAGGTG 


ATTGTCCAAT 


1 1 1 IAAGAAA 


4740 


CG 1 1 1 It I CA 


AATTAATGGG 


CAC 1 1 CI 1 1 A 


TTGGCTCATA 


GTGAAATATC 


TCATAAAACT 


4800 


AGAGCCAAAA 


TTAATTTATC 


AAAAGTTGTT 


GATTTGATTT 


ATGTTGATAA 


AGAAAACATT 


4860 


GATCGTTCCA 


ATCATCGAAA 


TTTCAGTGAT 


GTGTTATTGT 


TGGATCATGC 


ATTCAAAATC 


4920 


AAATTTGCTA 


ATGGTGAGTT 


GATTGAI 1 i 1 


TGTGCTCCTA 


ATAAACATGA 


AATGAAAATA 


4980 


TGGATTCAAA 


ATTTACAAGA 


AATTATCTAT 


AGAAATCGGT 


TCAGACGTCA 


ACCATGGGTA 


5040 


AATTTGATGC 


TTCAACAACA 


ACAACAACAA 


CAACAACAAC 


AAAGCTCCCA 


ACAGTAATTG 


5100 


AAAGGTCTAC 


1 1 1 1GAI M 1 


TTTAAI 1 1 IA 


ATTGGCAAAT 


ATATGCCCAT 


1 1 1 G 1 ATT AT 


5160 


CI 1 1 IAGTCT 


AATAGCGTTT 


TC 1 M 1 1 1 1 C 


CAGT 






5194 


(2) INFORMATION FOR SEQ ID NO: 2: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1664 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) molecule type: protein 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Asn ser Thr Pro ser Lys Leu Leu Pro lie Asp Lys His Ser His 
15 10 15 

Leu Gin Leu Gin Pro Gin Ser Ser Ser Ala Ser lie Phe Asn Ser Pro 
20 25 30 

Thr Lys Pro Leu Asn Phe Pro Arg Thr Asn Ser Lys Pro Ser Leu Asp 



35 



40 



45 



pro Asn ser ser Ser Asp Thr Tyr Thr ser Glu Gin Asp Gin Glu Lys 
50 55 60 

Gly Lys Glu Glu Lys Lys Asp Thr Ala Phe Gin Thr Ser Phe Asp Arg 
65 70 75 80 

Asn Phe Asp Leu Asp Asn ser lie Asp lie Gin Gin Thr lie Gin His 
85 90 95 

Gin Gin Gin Gin Pro Gin Gin Gin Gin Gin Leu ser Gin Thr Asp Asn 
100 105 110 

Asn Leu lie Asp Glu Phe Ser Phe Gin Thr Pro Met Thr Ser Thr Leu 
115 120 125 

Asp Leu Thr Lys Gin Asn Pro Thr val Asp Lys Val Asn Glu Asn His 
130 135 140 

Ala Pro Thr Tyr lie Asn Thr Ser Pro Asn Lys ser lie Met Lys Lys 
145 150 155 160 

Ala Thr Pro Lys Ala Ser Pro Lys Lys Val Ala Phe Thr val Thr Asn 
165 170 175 

Pro Glu lie His His Tyr Pro Asp Asn Arg val Glu Glu Glu Asp Gin 
180 185 190 

Ser Gin Gin Lys Glu Asp Ser Val Glu Pro Pro Leu lie Gin His Gin 
195 200 205 

Trp Lys Asp Pro ser Gin Phe Asn Tyr Ser Asp Glu Asp Thr Asn Ala 
210 215 220 

Ser val Pro Pro Thr Pro Pro Leu His Thr Thr Lys Pro Thr Phe Ala 
225 230 235 240 

Gin Leu Leu Asn Lys Asn Asn Glu Val Asn Ser Glu Pro Glu Ala Leu 
245 250 255 

Thr Asp Met Lys Leu Lys Arg Glu Asn Phe Ser Asn Leu ser Leu Asp 
260 265 270 

Glu Lys Val Asn Leu Tyr Leu Ser Pro Thr Asn Asn Asn Asn ser Lys 
275 280 285 

Asn Val Ser Asp Met Asp Ser His Leu Gin Asn Leu Gin Asp Ala Ser 
290 295 300 

Lys Asn Lys Thr Asn Glu Asn lie His Asn Leu ser Phe Ala Leu Lys 
305 310 315 320 

Ala Pro Lys Asn Asp lie Glu Asn Pro Leu Asn ser Leu Thr Asn Ala 
325 330 335 

Asp lie Ser Leu Arg Ser Ser Gly Ser Ser Gin ser ser Leu Gin Ser 
340 345 350 

Leu Arq Asn Asp Asn Arg val Leu Glu Ser val Pro Gly ser Pro Lys 
355 360 365 

Lys val Asn Pro Gly Leu ser Leu Asn Asp Gly lie Lys Gly Phe ser 
370 375 380 
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Asp Glu val val Glu Ser Leu Leu Pro Arg Asp Leu Ser Arg Asp Lys 
385 390 395 400 

Leu Glu Thr Thr Lys Glu His Asp Ala Pro Glu His Asn Asn Glu Asn 
405 410 415 

Phe lie Asp Ala Lys Ser Thr Asn Thr Asn Lys Gly Gin Leu Leu Val 
420 425 430 

Ser Ser Asp Asp His Leu Asp ser Phe Asp Arg ser Tyr Asn His Thr 
435 440 445 

Glu Gin Ser lie Leu Asn Leu Leu Asn ser Ala ser Gin ser Gin lie 
450 455 460 

Ser Leu Asn Ala Leu Glu Lys Gin Arg Gin Thr Gin Glu Gin Glu Gin 
465 470 475 480 

Thr Gin Ala Ala Glu Pro Glu Glu Glu Thr ser Phe ser Asp Asn lie 
485 490 495 

Lvs val Lys Gin Glu Pro Lys Ser Asn Leu Glu Phe Val Lys val Thr 
500 505 510 

lie Lvs Lys Glu Pro val ser Ala Thr Glu lie Lys Ala Pro Lys Arg 
515 520 525 

Glu Phe ser Ser Arg He Leu Arg lie Lys Asn Glu Asp Glu He Ala 
530 535 540 

Glu Pro Ala Asp lie His Pro Lys Lys Glu Asn Glu Ala Asn Ser His 
545 550 555 560 

Val Glu Asp Thr Asp Ala Leu Leu Lys Lys Ala Leu Asn Asp Asp Glu 
565 570 575 

Glu Ser Asp Thr Thr Gin Asn Ser Thr Lys Met Ser lie Arg Phe His 
580 585 590 

He Asp ser Asp Trp Lys Leu Glu Asp ser Asn Asp Gly Asp Arg Glu 
595 600 605 

Asp Asn Asp Asp lie Ser Arg Phe Glu Lys Ser Asp lie Leu Asn Asp 
610 615 620 

val ser Gin Thr ser Asp lie lie Gly Asp Lys Tyr Gly Asn Ser Ser 
625 630 635 640 

Ser Glu lie Thr Thr Lys Thr Leu Ala Pro Pro Arg Ser Asp Asn Asn 
645 650 655 

asp Lys Glu Asn Ser Lys Ser Leu Glu Asp Pro Ala Asn Asn Glu Ser 
660 665 670 

Leu Gin Gin Gin Leu Glu val Pro His Thr Lys Glu Asp Asp ser lie 
675 680 685 

Leu Ala Asn Ser Ser Asn lie Ala Pro Pro Glu Glu Leu Thr Leu Pro 
690 695 700 

val val Glu Ala Asn Asp Tyr Ser Ser Phe Asn Asp Val Thr Lys Thr 
705 710 715 720 



Phe Asp Ala Tyr ser ser Phe Glu Glu ser Leu Ser Arg Glu His Glu 
725 730 735 

Thr Asp Ser Lys Pro lie Asn Phe lie ser lie Trp His Lys Gin Glu 
740 745 750 

Lys Gin Lys Lys His Gin lie His Lys val Pro Thr Lys Gin lie lie 
755 760 765 

Ala ser Tyr Gin Gin Tyr Lys Asn Glu Gin Glu ser Arg val Thr Ser 
770 775 780 

Asp Lys Val Lys lie Pro Asn Ala lie Gin Phe Lys Lys Phe Lys Glu 
785 790 795 800 

Val Asn val Met Ser Arg Arg val val ser Pro Asp Met Asp Asp Leu 
805 810 815 

Asn val Ser Gin Phe Leu Pro Glu Leu ser Glu Asp Ser Gly Phe Lys 
820 825 830 

Asp Leu Asn Phe Ala Asn Tyr Ser Asn Asn Thr Asn Arg Pro Arg Ser 
835 840 845 

Phe Thr Pro Leu Ser Thr Lys Asn val Leu Ser Asn lie Asp Asn Asp 
850 855 860 

Pro Asn val val Glu Pro Pro Glu Pro Lys ser Tyr Ala Glu lie Arg 
865 870 875 880 

Asn Ala Arg Arg Leu Ser Ala Asn Lys Ala Ala Pro Asn Gin Ala Pro 
885 890 895 

Pro Leu Pro Pro Gin Arg Gin Pro Ser Ser Thr Arg ser Asn Ser Asn 
900 905 910 

Lys Arq val ser Arg Phe Arg val Pro Thr Phe Glu lie Arg Arg Thr 
915 920 925 

Ser Ser Ala Leu Ala Pro cys Asp Met Tyr Asn Asp lie Phe Asp Asp 
930 935 940 

Phe Gly Ala Gly Ser Lys Pro Thr lie Lys Ala Glu Gly Met Lys Thr 
945 950 955 960 

Leu Pro ser Met Asp Lys Asp Asp val Lys Arg lie Leu Asn Ala Lys 
965 970 975 

Lys Gly Val Thr Gin Asp Glu Tyr lie Asn Ala Lys Leu Val Asp Gin 
980 985 990 

Lys Pro Lys Lys Asn Ser lie val Thr Asp Pro Glu Asp Arg Tyr Glu 
995 1000 1005 

Glu Leu Gin Gin Thr Ala Ser lie His Asn Ala Thr lie Asp ser ser 
1010 1015 1020 

lie Tyr Gly Arg pro Asp Ser lie Ser Thr Asp Met Leu Pro Tyr Leu 
1025 1030 1035 1040 

Ser Asp Glu Leu Lys Lys Pro Pro Thr Ala Leu Leu Ser Ala Asp Arg 
1045 1050 1055 

Leu Phe Met Glu Gin Glu Val His Pro Leu Arg Ser Asn Ser val Leu 
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1060 1065 1070 

val His Pro Gly Ala Gly Ala Ala Thr Asn Ser Ser Met Leu Pro Glu 
1075 1080 1085 

pro Asp Phe Glu Leu lie Asn Ser Pro Ala Arg Asn Val ser Asn Asn 
1090 1095 1100 

Ser Asp Asn val Ala lie ser Gly Asn Ala Ser Thr lie Ser Phe Asn 
1105 1110 1115 1120 

Gin Leu Asp Met Asn Phe Asp Asp Gin Ala Thr lie Gly Gin Lys lie 
1125 1130 1135 

Gin Glu Gin Pro Ala Ser Lys Ser Ala Asn Thr val Arg Gly Asp Asp 
1140 1145 1150 

Asp Gly Leu Ala Ser Ala Pro Glu Thr Pro Arg Thr Pro Thr Lys Lys 
1155 1160 1165 

Glu ser lie Ser Ser Lys Pro Ala Lys Leu Ser Ser Ala Ser Pro Arg 
1170 1175 1180 

Lys Ser Pro lie Lys He Gly Ser Pro Val Arg val lie Lys Lys Asn 
1185 1190 1195 1200 

Gly ser lie Ala Gly He Glu Pro He Pro Lys Ala Thr His Lys Pro 
1205 1210 1215 

Lys Lys Ser Phe Gin Gly Asn Glu lie ser Asn His Lys val Arg Asp 
1220 1225 1230 

Gly Gly lie ser Pro Ser Ser Gly Ser Glu His Gin Gin His Asn Pro 
1235 1240 1245 

Ser Met val Ser Val Pro ser Gin Tyr Thr Asp Ala Thr ser Thr Val 
1250 1255 1260 

Pro Asp Glu Asn Lys Asp Val Gin His Lys Pro Arg Glu Lys Gin Lys 
1265 1270 1275 1280 

Gin Lys His His His Arg His His His His His His Lys Gin Lys Thr 
1285 1290 1295 

Asp lie Pro Gly val Val Asp Asp Glu lie Pro Asp Val Gly Leu Gin 
1300 1305 1310 

Glu Arg Gly Lys Leu Phe Phe Arg val Leu Gly lie Lys Asn lie Asn 
1315 1320 1325 

Leu Pro Asp lie Asn Thr His Lys Gly Arg Phe Thr Leu Thr Leu Asp 
1330 1335 1340 

Asn Gly val His Cys Val Thr Thr Pro Glu Tyr Asn Met Asp Asp His 
1345 1350 1355 1360 

Asn val Ala lie Gly Lys Glu Phe Glu Leu Thr val Ala Asp Ser Leu 
1365 1370 1375 

Glu Phe lie Leu Thr Leu Lys Ala ser Tyr Glu Lys Pro Arg Gly Thr 
1380 1385 1390 

Leu val Glu Val Thr Glu Lys Lys Val val Lys ser Arg Asn Arg Leu 
1395 1400 1405 
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Ser Arq Leu Phe Gly Ser Lys Asp lie He Thr Thr Thr Lys Phe Val 
1410 1415 1420 

pro Thr Glu Val Lys Asp Thr Trp Ala Asn Lys Phe Ala Pro Asp Gly 
1425 1430 1435 1440 

ser Phe Ala Arq Cys Tyr He Asp Leu Gin Gin Phe Glu Asp Gin He 
1445 1450 1455 

Thr Gly Lys Ala ser Gin Phe Asp Leu Asn Cys Phe Asn Glu Trp Glu 
1460 1465 1470 

Thr Met Ser Asn Gly Asn Gin Pro Met Lys Arg Gly Lys Pro Tyr Lys 
1475 1480 1485 

lie Ala Gin Leu Glu val Lys Met Leu Tyr Val Pro Arg Ser Asp Pro 
1490 1495 1500 

Arq Glu lie Leu Pro Thr ser He Arg Ser Ala Tyr Glu ser He Asn 
1505 1510 1515 1520 

Glu Leu Asn Asn Glu Gin Asn Asn Tyr Phe Glu Gly Tyr Leu His Gin 
1525 1530 1535 

Glu Gly Gly Asp Cys Pro lie Phe Lys Lys Arg Phe Phe Lys Leu Met 
1540 1545 1550 

Gly Thr ser Leu Leu Ala His Ser Glu lie Ser His Lys Thr Arg Ala 
1555 1560 1565 

Lvs lie Asn Leu Ser Lys Val Val Asp Leu lie Tyr Val Asp Lys Glu 
1570 1575 1580 

Asn lie Asp Arg ser Asn His Arg Asn Phe Ser Asp Val Leu Leu Leu 
1585 1590 1595 1600 

Asp His Ala Phe Lys lie Lys Phe Ala Asn Gly Glu Leu lie Asp Phe 
1605 1610 1615 

Cys Ala Pro Asn Lys His Glu Met Lys lie Trp lie Gin Asn Leu Gin 
y 1620 1625 1630 

Glu lie lie Tyr Arq Asn Arg Phe Arg Arg Gin Pro Trp Val Asn Leu 
1635 1640 1645 

Met Leu Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin ser Ser Gin Gin 
1650 1655 1660 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) feature^ information: amino acid positions 218-453 from SEQ ID NO: 2 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ser Asp Glu Asp Thr Asn Ala Ser val Pro Pro Thr Pro Pro Leu His 
15 10 15 

Thr Thr Lys Pro Thr Phe Ala Gin Leu Leu Asn Lys Asn Asn Glu Val 
20 25 30 

Asn Ser Glu Pro Glu Ala Leu Thr Asp Met Lys Leu Lys Arg Glu Asn 
35 40 45 

Phe Ser Asn Leu Ser Leu Asp Glu Lys Val Asn Leu Tyr Leu Ser Pro 
50 55 60 

Thr Asn Asn Asn Asn Ser Lys Asn Val Ser Asp Met Asp Ser His Leu 
65 70 75 80 

Gin Asn Leu Gin Asp Ala Ser Lys Asn Lys Thr Asn Glu Asn lie His 
85 90 95 

Asn Leu Ser Phe Ala Leu Lys Ala Pro Lys Asn Asp lie Glu Asn Pro 
100 105 110 

Leu Asn Ser Leu Thr Asn Ala Asp lie ser Leu Arg ser ser Gly ser 
115 120 125 

Ser Gin Ser Ser Leu Gin Ser Leu Arg Asn Asp Asn Arg Val Leu Glu 
130 135 140 

Ser val Pro Gly Ser Pro Lys Lys val Asn Pro Gly Leu Ser Leu Asn 
145 150 155 160 

Asp Gly lie Lys Gly Phe Ser Asp Glu Val val Glu Ser Leu Leu Pro 
165 170 175 

Arq Asp Leu Ser Arg Asp Lys Leu Glu Thr Thr Lys Glu His Asp Ala 
180 185 190 

□ Pro Glu His Asn Asn Glu Asn Phe lie Asp Ala Lys Ser Thr Asn Thr 
195 200 205 

Asn Lys Gly Gin Leu Leu val Ser Ser Asp Asp His Leu Asp Ser Phe 
210 215 220 

Asp Arg Ser Tyr Asn His Thr Glu Gin Ser lie Leu 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(c) strandedness: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Tyr Leu Ser Pro Thr Asn Asn Asn Asn Ser Lys Asn Val Ser Asp Met 
1 5 10 15 
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Asp Leu His Leu Gin Asn Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(c) strandedness: single 
(D) TOPOLOGY: linear 

(ii) molecule type: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Asp Trp Lys Leu Glu Asp Ser Asn Asp Gly Asp Arg Glu Asp Asn Asp 
15 10 15 

Asp lie Ser Arg Phe Glu Lys 
20 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) type: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ser Lys Ser Ala Asn Thr val Arg Gly Asp Asp Asp Gly Leu Ala ser 
15 10 15 

Ala 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) type: amino acid 

(C) strandedness: single 

(D) TOPOLOGY: linear 

(ii) molecule type: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Asp His Leu Asp Ser Phe Asp Arg Ser Tyr Asn His Thr Glu Gin Ser 
15 10 15 
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lie 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Trp lie Gin Asn Leu Gin Glu lie lie Tyr Arg Asn Arg Phe Arg Arg 
15 10 15 

Gin 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
(c) strandedness: single 
(D) TOPOLOGY: linear 

(ii) molecule TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GAATTCAATG CTACCCTCAA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) type: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE type: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CCCGGGGGAC CCCCTTCACT 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomi c) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
AARGTYGGWT TYTTYAAR 
(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
(c) strandedness: single 
(D) topology: linear 

(ii) molecule type: DNA (genomic) 



(xi) sequence description: seq id no: 12 

GAAATHGAYG AYTTRATG 
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